Comparing People with Bibliometrics

نویسنده

  • Michael J. Kurtz
چکیده

Bibliometric indicators, citation counts and/or download counts are increasingly being used to inform personnel decisions such as hiring or promotions. These statistics are very often misused. Here we provide a guide to the factors which should be considered when using these so-called quantitative measures to evaluate people. Rules of thumb are given for when begin to use bibliometric measures when comparing otherwise similar candidates. Some things you should know when using bibliometrics to compare people This is a serious business. Careers can be greatly affected by the decisions which flow from bibliometric evaluations. [1,2] It is important to know how accurate these measures are. Measures such as citation or download counts are not important in themselves, they are proxies for evaluating the (research) capabilities of an individual. [1,3,4] While the relative completeness and accuracy of some sources of bibliometric information are frequently discussed, this is not particularly important to comparisons of people. The statistical scatter, even with “perfect” input data in these measures is quite large, and puts limits on any attempt to use the data to evaluate people. [1] Before using citations or downloads to compare individuals these measures must be adjusted to account for the known systematic errors due to age, discipline, and co-authorship. [5] The total number of citations a person’s work receives increases approximately quadratically with the “age” of that individual, where “age” means time as an active researcher. [5,6] The correction is that the square root of the number of citations, divided by “age” is an approximate constant for persons with uniform productivity. For younger researchers determining the effective “age” is problematic, and makes this correction uncertain. [1] Different disciplines, and subdisciplines, have different citation cultures, citation counts cannot be directly compared without correcting for this. [5,7] Multi-author papers are now the norm. Bibliometric measures must take the degree of co-authorship into account. [5,8,9] For comparisons with individuals measured at different times one must account for the steady increase in the number of papers, which double about every 15 years [10,11], and in the number of references per paper, which is experiencing a similar increase [12,13]. The TORI statistic [5], which normalizes citations by both the number of authors on the cited document and the number of references in the citing document, is intended to remove the effects of co-authorship and citation culture. The RIQ statistic [5] is intended to remove the effects of age. Both are available on the ADS metrics pages. Downloads, or reads, are also a useful bibliometric indicator [4,9,14]. Downloads by researchers have very similar properties to, and can predict citations [11,14,15]. Great care must be taken when using download information, however, as downloads by persons who are not researchers have very different properties, and can dominate the statistic [4,7,13]. The ADS metrics pages show the Read10 statistic [1,9], which is the yearly sum of co-author number normalized reads by scientists via ADS of papers written in the preceding ten years. Because the use of internet based services has grown rapidly in the recent past care must be taken when comparing download counts from different time frames. Using samples and measures designed to minimize the systematic errors in the measurements we can determine the intrinsic scatter in the use of bibliometric techniques to evaluate people. We do this by comparing different measures of exactly the same sets of papers by exactly the same people. Because the actual impact of these sets of papers is identical (because they are exactly the same) the differences in impact as measured by different techniques must be due to the intrinsic measurement error in the techniques. Complete details are in [1]. Because citations and reads of people form a log-normal distribution [1,17] the error is in the logarithm of the measurements. For the comparisons using active mid-career astronomers the one standard deviation error is 0.17 dex, a multiplicative factor of 1.48. [1] Citations (and downloads, by extension) are also used to predict performance; essentially this is why they are used in hiring decisions. For individuals five years past the PhD, about the time when assistant professor positions are granted, even using an ​a posteriori ​selected sample, substantially more prescient than any contemporaneously selected sample could be, the one standard deviation error is 0.395 dex, a multiplicative factor of 2.5. [1]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predictive Ranking of Computer Scientists Using CiteSeer Data

The increasing availability of digital libraries with cross-citation data on the Internet enables new studies in bibliometrics. We focus on the list of 10,000 top-cited authors in computer science available as part of CiteSeer. Using data from several consecutive lists, we construct a model of how authors accrue citations with time. By comparing the rate at which individual authors accrue citat...

متن کامل

Bibliometrics to webometrics

Bibliometrics has changed out of all recognition since 1958; becoming established as a field, being taught widely in library and information science schools, and being at the core of a number of science evaluation research groups around the world. This was all made possible by the work of Eugene Garfield and his Science Citation Index. This article reviews the distance that bibliometrics has tr...

متن کامل

The role of conference publications in computer science: a bibliometric view

This contribution gives a bibliometric perspective on the recently strongly discussed topic concerning the role of conferences in computer science. We investigate the frequency and impact of conference publications in computer science, comparing with journal articles. It turns out that, from a bibliometric perspective, the best strategy to gain impact is that of publishing few, final, and well-...

متن کامل

An empirical comparison of topology-based hypertext similarity measures

The ability to accurately estimate the semantic distance between hypertext documents is an essential component in many approaches to overcoming the well-known problem of information overload. For example, an author trying to compile a definitive “hub” index on a particular topic needs to know the similarity between the pages that have already been incorporated into the index, and other pages fr...

متن کامل

Bibliometrics in Online Book Discussions: Lessons for Complex Search Tasks

Online book discussion forums provide rich information on how readers think about and describe books, how books are related to other books and how people search for and recommend books. Within the Social Book Search (SBS) Lab at CLEF we analyse book search requests on the LibraryThing forums and find several types of complex search tasks where bibliometrics naturally combines with information r...

متن کامل

How to calculate the practical significance of citation impact differences? An empirical example from evaluative institutional bibliometrics using adjusted predictions and marginal effects

Evaluative bibliometrics is concerned with comparing research units by using statistical procedures. According to Williams (2012) an empirical study should be concerned with the substantive and practical significance of the findings as well as the sign and statistical significance of effects. In this study we will explain what adjusted predictions and marginal effects are and how useful they ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.09955  شماره 

صفحات  -

تاریخ انتشار 2017